Load balancing is the process of distributing incoming network traffic across multiple targets, such as EC2 instances, containers, or IP addresses, to improve application availability, fault tolerance, and scalability [citation:1].
Load balancing is the process of efficiently distributing incoming network traffic across a group of backend servers, also known as a target group. Its primary goals are to prevent any single server from becoming overwhelmed, to ensure high availability by rerouting traffic if a server fails, and to enable seamless scaling as application demand grows.